翻訳と辞書
Words near each other
・ Heap spraying
・ Heap Steep Glacier
・ Heap's algorithm
・ Heap's Rice Mill
・ Heap, Bury
・ Heapey
・ Heapey railway station
・ Heapham
・ Heaphy
・ Heaphy River
・ Heaphy Spur
・ Heaphy Tin Man
・ Heaphy Track
・ Heaps (surname)
・ Heaps Rock
Heaps' law
・ Heapsort
・ Hear & Now
・ Hear & Now (Billy Squier album)
・ Hear & Now (Don Cherry album)
・ Hear 'Em Rave
・ Hear 'n Aid
・ Hear (Diesel album)
・ Hear (disambiguation)
・ Hear and Now
・ Hear and Now (album)
・ Hear in the Now Frontier
・ Hear It Is
・ Hear It Now
・ Hear Me


Dictionary Lists
翻訳と辞書 辞書検索 [ 開発暫定版 ]
スポンサード リンク

Heaps' law : ウィキペディア英語版
Heaps' law

In linguistics, Heaps' law (also called Herdan's law) is an empirical law which describes the number of distinct words in a document (or set of documents) as a function of the document length (so called type-token relation). It can be formulated as
: V_R(n) = Kn^\beta
where ''VR'' is the number of distinct words in an instance text of size ''n''. ''K'' and β are free parameters determined empirically. With English text corpora, typically ''K'' is between 10 and 100, and β is between 0.4 and 0.6.
The law is frequently attributed to Harold Stanley Heaps, but was originally discovered by .〔: "Herdan's law in linguistics and Heaps' law in information retrieval are different formulations of the same phenomenon".〕 Under mild assumptions, the Herdan–Heaps law is asymptotically equivalent to Zipf's law concerning the frequencies of individual words within a text.〔; ; .〕 This is a consequence of the fact that the type-token relation (in general) of a homogenous text can be derived from the distribution of its types.
Heaps' law means that as more instance text is gathered, there will be diminishing returns in terms of discovery of the full vocabulary from which the distinct terms are drawn.
It is interesting to note that Heaps' law also applies to situations in which the "vocabulary" is just some set of distinct types which are attributes of some collection of objects. For example, the objects could be people, and the types could be country of origin of the person. If persons are selected randomly (that is, we are not selecting based on country of origin), then Heaps' law says we will quickly have representatives from most countries (in proportion to their population) but it will become increasingly difficult to cover the entire set of countries by continuing this method of sampling.
==Notes==


抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)
ウィキペディアで「Heaps' law」の詳細全文を読む



スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース

Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.